Clustering Orthologous Protein Sequences thru Python Based Program
نویسندگان
چکیده
An alignment is an arrangement of two sequences which shows where the two sequences are similar, and where they differ. Here, in this paper, we report multiple alignments of few orthologous insulin sequences and construction of a python program based on average linkage clustering algorithm to generate clusters of significant similarities. Insulin sequences are extracted from uniprot knowledgebase and the multiple alignments are carried out using clustalw program. The resultant scores are subjected to clustering technique and the significant clusters are generated. Keywords— Clustering, ClustalW, Insulin, Multiple Alignments, Python
منابع مشابه
A Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm
Clustering techniques have been widely used in the fields of information technology, biomedical sciences. Cluster analysis deals with the identification of a set of objects into subsets with some sort of similarities. Such groups are assigned to have similar function. In this paper, a modified group average clustering program was written in python language and applied on a dataset of IGF1R prot...
متن کاملA Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm
Clustering techniques have been widely used in the fields of information technology, biomedical sciences. Cluster analysis deals with the identification of a set of objects into subsets with some sort of similarities. Such groups are assigned to have similar function. In this paper, a modified group average clustering program was written in python language and applied on a dataset of IGF1R prot...
متن کاملApplication of Subspace Clustering in DNA Sequence Analysis
Identification and clustering of orthologous genes plays an important role in developing evolutionary models such as validating convergent and divergent phylogeny and predicting functional proteins in newly sequenced species of unverified nucleotide protein mappings. Here, we introduce an application of subspace clustering as applied to orthologous gene sequences and discuss the initial results...
متن کاملAmplicon: software for designing PCR primers on aligned DNA sequences
SUMMARY Amplicon is a program for designing PCR primers on aligned groups of DNA sequences. The most important application for Amplicon is the design of 'group-specific' PCR primer sets that amplify a DNA region from a given taxonomic group but do not amplify orthologous regions from other taxonomic groups. AVAILABILITY Amplicon is freely available as a script that will run on any platform wi...
متن کاملHigh-quality sequence clustering guided by network topology and multiple alignment likelihood
MOTIVATION Proteins can be naturally classified into families of homologous sequences that derive from a common ancestor. The comparison of homologous sequences and the analysis of their phylogenetic relationships provide useful information regarding the function and evolution of genes. One important difficulty of clustering methods is to distinguish highly divergent homologous sequences from s...
متن کامل